Discussion 311 Group 4

Introduction

During the research and distribution periods of the COVID-19 vaccines, social media-driven anti-vaccination campaigns have gained popularity, particularly among the Facebook1 and Instagram2 communities. We want to find whether there is a relationship between environmental and societal factors and vaccine adoption rates.

We theorize that urban, educated, and older populations are more likely to have higher rates of vaccine adoption than those populations with a higher proportion of rural residents, less education, or lower median age.

Background

Variables

Vaccine Data

  • one_dose_at_least_p: Percent people get at least one dose
  • completed_series: People who completed the series
  • completed_series_p: Percent people who completed the series

Urban and Rural Data

  • population: Total population of the county (All ages)
  • urban: Population living in an urban environment of the county
  • rural: Population living in a rural environment of the county

Education Data

  • pct_m_hs: Percent males estimated high school graduate or higher
  • pct_f_hs: Percent females estimated high school graduate or higher
  • pct_m_bach: Percent males estimated bachelor’s degree or higher
  • pct_f_bach: Percent females estimated Percent bachelor’s degree or higher

Median Age data

  • median_age: Median age in each county
  • county: County in Wisconsin

Note for Missing Data:

  • Although the education data is only for 25+, we estimate that the vaccination rate of those under 25 is insignificant and thus estimate the education population groups on the urban and rural population data.
  • There are some missing values in wi_zip.csv, wi_income.csv, wi_age.csv, but the proportion is statistically insignificant.

Data Collection

Vaccine Data

Source: Wisconsin Department of Health Services & The Wisconsin Immunization Registry (WIR)3

Every night at 11:30 pm we extract vaccine administration data from WIR that will be reported on the DHS website by 2:00 pm the following day. WIR is a live system and providers are constantly sending immunization data. Therefore, data will look different if it is extracted at a different time of day.

Vaccination administration: The cumulative number of COVID-19 vaccines administered. The Vaccine Distribution Summary includes all vaccine doses administered by Wisconsin vaccine providers. This includes doses administered to people who resided out-of-state, but who live, work, or study in Wisconsin and qualify for vaccination in-state. This provides information to track the allocation, distribution, and administration of vaccine by Wisconsin’s vaccinators. The COVID-19 Vaccines for Wisconsin Residents dashboard displays data for Wisconsin recipients of the vaccine. This information is used to inform vaccination coverage for the state.

Vaccine dose: One vaccine dose is one vaccine product (like a shot or a nasal spray). Some vaccines require two or more doses to protect you fully against a disease. Other vaccines give you enough protection to fight the disease after just one dose or shot.

Series completion: Many vaccines require multiple doses spaced out by weeks, months, or years to provide the best protection against a disease. Once someone receives the recommended number of doses within the correct timeframe, their series is considered complete.

Urban/Rural, Income, and Education Data

These datasets are from previous class assignments, particularly Homework 64.

Zip Code and County Information

The data were from ZipCodesToGo, combined with Longitude/Latitude Coordinates for visual maps5.

Wisconsin Median Age

The data was from a local news report which was based on the U.S. Census Bureau6.

Data Legitimacy, Trustworthiness, and Other Notes

  • Vaccine Data and Wisconsin Median Age are provided from WI Governmental agencies and are therefore fairly reputable.
  • We gathered much of the data, particular county demographics, from the homework, which prior was gathered from reliable sources.
  • Zip Code Information is readily available and is not subject to interpretation.
  • Median Age is originally sourced through the U.S. Census, which could have some underlying patterns, but is also fairly reputable.
  • The data of education and income are from the assignment, so they may not be as reflective of today’s data as the vaccination data.

Backgroud description & relation to topic

People with different levels of education may have different understandings of vaccines. One theory is that populations with limited education may think of vaccines, especially newly developed vaccines as particularly harmful or not well-studied; whereas, populations with more education might better understand the rigorous safety standards and trials the vaccine has been scrutinized by.

Concerning the urbanization of the environment, differences in residential densities could affect accessibility and/or the speed of distribution.

At the end of the study, we will have a better understanding of the demographic of vaccine recipients.

Analysis

Vaccination Visualization Across WI

Urbanization Rate

Note: The red dot we marked is the position of the capital Madison.

Places that are more urbanized such as Madison, Milwaukee, and Green Bay, tend to have a higher rate of vaccinations.

Among the urban cities, Madison has the lightest color, which means the vaccination rate in Madison is the highest. One possible reason to account for this is that the campus of the University of Wisconsin-Madison is at Madison, therefore, the education level of people there is also higher. Thus, the vaccination rate there would be higher.

Interestingly, there are smaller counties with less urbanization rate (e.g. Door and Bayfield, which are small yellow dots in the graph) that are particularly highly vaccinated versus other, larger counties, so there might be other factors responsible for such a relative vaccination rate.

Level of Education

Places that are more urbanized tend to have a higher level of education.

Among communities with a higher mean education level, the vaccination rate tends to be higher. This can also explain why the places like Sister Bay and Moquah (in the East and North respectively) are with high vaccination rate, despite being in rural counties.

Age

As we analyzed the data between vaccination rate and urban percent, education level, we found out that some of the locations may have a conflict with our assertion.

Places such as Door and Bayfield have an abnormally high vaccination rate where their education level and percent urban are not as high as big cities. We recognized that different counties have different age group distribution.

Especially, for Door and Bayfield, there are a particularly high number of retired people. We made an inference that the high average age in these counties could be correlated in a meaningful way to the high relative vaccination rate.

From the graph, these counties show higher median ages than other counties As we did a further investigation, we found out that the government’s policy of distribution vaccine has relation with the age. According to the CDC’s COVID-19 Vaccine Rollout Recommendations, there are three phases of distributing vaccines. The second phase who are eligible to get the vaccine are people aged 75 years and older where the time that the data was collected. This might be a direct reason why these places have an abnormally high vaccination rate.

Places that are more urbanized tend to have a higher rate of vaccinations. Among communities with a higher mean education level, the vaccination rate tends to be higher. Places with more elderly people tend to have higher vaccination rates.

Education Level and Vaccination

In the above graph, we performed a regression on how the vaccine rate and education level are correlated with each other. For both genders and both education levels, it seems that there is a strong and positive linear relationship.

We will then do a residual plot to prove whether this correlation is statistically significant.

Residual Analysis

We see no obvious patterns from the residual plots, as the points are distributed uniformly at random.

In the above graph, we performed a regression on how the vaccine rate and education level are correlated with each other. For both genders, it seems that there is a strong and positive linear relationship.

For the linear model of one_dose_at_least_p on pct_m_bach, the estimated intercept is 0.25646 and the estimated slope is 0.36844, and the p-value of each test is extremely small, suggesting these estimates are statistically significant.

For the linear model of one_dose_at_least_p on pct_f_bach, the estimated intercept is 0.23749 and the estimated slope is 0.23749, and the p-value of each test is extremely small, suggesting these estimates are extremely statistically significant.

There is a statistically significant linear relationship between education rate and vaccination rates by county.

Income and Education Level

The two graphs above show that with a higher average education level, populations tend to have a higher average income.

Thus, we can form an indirect relationship between higher levels of vaccine adoption rate and higher levels of income.

Interestingly, although the explicit cost of the vaccine is nothing, there is an implicit time cost that could affect households in more essential or time-based jobs, or those households are deterred by longer wait times.

In the following part, we would do a test to verify there is a positive correlation between these two factors.

The following is the plot:

There are some patterns in the data suggesting a linear model is perhaps not the most appropriate. Particularly, the residuals < $30,000 tend to be positive. Additionally, there is a large clustering where the normal distribution of income is strongest (~$45,000-$55,000). We will, however, continue with the fairly reasonable linear model.

The p-value of a Pearson’s product-moment test for correlation between one_dose_at_least_p and income is 1.168e-06, which is less than the significance level alpha of 0.05. We can conclude that vaccination rate and income are significantly correlated with a correlation coefficient of 0.178525.

Normality of Vaccine Distribution Proportions

As an aside, we can observe that the distributions of the dose distribution proportions across zip codes are approximately normal, giving us greater confidence in our p-values.

Discussion

Broader interpretations of analysis

A possible explanation for the positive correlation for urban environments could stem from people not having to travel as far to receive the vaccine due to the population density being higher.

A possible explanation for the positive correlation for income could stem from people having a better understanding of vaccines and discern misinformation about the inefficacy or risks associated with vaccines.

A possible explanation surrounding age is the general roll-out strategy of the state for more elderly and at-risk populations to first receive the vaccine, as well as those populations having more of a risk if they are not vaccinated.

Potential shortcoming

  • The WI datasets may not be fully representative of all cases nationally or globally.
  • Linear models of two variables are simplistic and do not offer the most holistic study review analysis.
  • Vaccination rate data is not fully complete and are only point estimates at the time of data retrieval, instead of time-based data.

Potential future directions for additional work

  • Since the vaccination process is still in progress, we can wait for a few months and redo the study. Try to prove that what we did is correct.
  • As we only researched in Wisconsin, it might not be true for other places in the world. We can collect more data from other places to verify our conclusion.

Primary conclusions and the primary evidence

From the beginning of the pandemic in early 2020, our lives have been meaningfully interrupted and negatively impacted. With CDC guidance to curb spread by widespread vaccination and ultimately herd immunity, we seek to analyze the effects of demographic factors on populations’ willingness and attitude toward a COVID vaccine. Despite the government providing low or no-cost vaccines in Wisconsin, we still observe some citizens rejecting getting vaccinated because of associated doubts of the effectiveness of and fears associated with a COVID vaccine.

We have found a meaningful positive correlation between higher levels of education and vaccine adoption rates as well as higher income levels and vaccine adoption rates.

Further Thoughts and Broader Implications

The research leads us to ponder whether the effect of people’s awareness of the benefits of vaccines would translate to a higher willingness to get vaccinated. We also ponder the correlation between the perceived efficacy of the vaccine and the vaccination rate.

References


  1. Facebook COVID-19 Information Center↩︎

  2. Instagram COVID-19 Information Center↩︎

  3. DHS Vaccine Data↩︎

  4. Homework 6↩︎

  5. ZipCodesToGo↩︎

  6. Madison.com↩︎